• The GDPR and Unstructured Data: Is Anonymisation Possible? 

      Weitzenboeck, Emily Mary; Lison, Pierre; Cyndecka, Malgorzata Agnieszka; Langford, Malcolm (Journal article; Peer reviewed, 2022)
      Much of the legal and technical literature on data anonymization has focused on structured data such as tables. However, unstructured data such as text documents or images are far more common, and the legal requirements ...
    • Identifying Token-Level Dialectal Features in Social Media 

      Barnes, Jeremy Claude; Touileb, Samia; Mæhlum, Petter; Lison, Pierre (Chapter, 2023)
      Dialectal variation is present in many human languages and is attracting a growing interest in NLP. Most previous work concentrated on either (1) classifying dialectal varieties at the document or sentence level or (2) ...